Distance Measures for Detecting Geo-Social Similarity

نویسندگان

  • Yaron Kanza
  • Elad Kravi
  • Eliyahu Safra
  • Yehoshua Sagiv
چکیده

This paper investigates the problem of geo-social similarity among users of online social networks, based on the locations of their activities (e.g., posting messages or photographs). Finding pairs of geo-socially similar users or detecting that two sets of locations (of activities) belong to the same user has important applications in privacy protection, recommendation systems, urban planning, public health, etc. It is explained and shown empirically that common distance measures between sets of locations are inadequate for determining geo-social similarity. Two novel distance measures between sets of locations are introduced. One is the mutually nearest distance that is based on computing a matching between two sets. The second measure uses a quad-tree index. It is highly scalable, but incurs the overhead of creating and maintaining the index. Algorithms with optimization techniques are developed for computing the two distance measures and also for finding the k-most similar users of a given one. Extensive experiments, using geo-tagged messages from Twitter, show that the new distance measures are both more accurate and more efficient than existing ones.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

New distance and similarity measures for hesitant fuzzy soft sets

The hesitant fuzzy soft set (HFSS), as a combination of hesitant fuzzy and soft sets, is regarded as a useful tool for dealing with the uncertainty and ambiguity of real-world problems. In HFSSs, each element is defined in terms of several parameters with arbitrary membership degrees. In addition, distance and similarity measures are considered as the important tools in different areas such as ...

متن کامل

An Empirical Comparison of Distance Measures for Multivariate Time Series Clustering

Multivariate time series (MTS) data are ubiquitous in science and daily life, and how to measure their similarity is a core part of MTS analyzing process. Many of the research efforts in this context have focused on proposing novel similarity measures for the underlying data. However, with the countless techniques to estimate similarity between MTS, this field suffers from a lack of comparative...

متن کامل

A DL-based Approach for Detecting Semantic Relations in Geo- Ontology Matching

Ontology matching produces mapping relations between elements of two ontologies and it is a basic problem in geographical information integration. Currently, most existing studies rely on all kinds of semantic similarity between the semantic entities to measure ontology mapping relations. However, these measures are not sufficient due to only detecting equivalence relation of compared ontologie...

متن کامل

A Systematic Survey of Point Set Distance Measures for Link Discovery

Large amounts of geo-spatial information have been made available with the growth of the Web of Data. While discovering links between resources on the Web of Data has been shown to be a demanding task, discovering links between geo-spatial resources proves to be even more challenging. This is partly due to the resources being described by the means of vector geometry. Especially, discrepancies ...

متن کامل

A revised Fuzzy - PROMETHEE method , using Fuzzy Distance and Similarity Measures

PROMETHEE refers to a collection of methods of ranking in the field of multi-criteria decision making. These methods are characterized by conceptual simplicity and practical applicability. However, the nature of phenomena involving decision-making in real world leads us to use fuzzy method of preference ranking. The most common criticism on mathematical ranking procedures is that they tend to d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017